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Title: A method for deteTTTi jjaAjag^ a mimo tope sequence ^ 

The invention relates to a method for determiriii 
Lmotope sequence 

Nowadays, random diversity libraries are wid^.y used 
to identify lead molecules for diagnostics, pharma^ .and 
vacciflg. When the lead molecule is a peptide, th^methods 
used f o^W identif yinq the lead molecules are oftien referred to 
as peps 

Pep^can methods have been known sin^e the early 
eighties. Th^basic theory behind these methods is described 
in EP-A-0 138 

Two pivdral aspects of screenMg of random diversity 
libraries are theitf^size, i.e. how nipfny different compounds 
are required to identify a lead mc^lfecule, and the method of 
optimization of the structure or^equence of the lead 
molecule. Most methods detecribga in the literature seem to be 
based on the idea that the\a/ger the random diversity 
library, the higher the chaj^^ of finding a good lead 
molecule . 

The aspect with j^spect\o which the known methods 
are most divergent, is^che methodVof optimization of the lead 
molecule. Once the random diversit}\library has been 
synthesized, it can^be tested for itsWdesired activity. Of 
course it is possjrble to simply choose\s a lead molecule the 
member of the random diversity library tBfe£t scores the 
highest valueifor said desired activity. Htewever, most 
methods comn^rise . steps for optimizing the sotucture of said 
member tha^t shows the hiqhest activity, in oi\er to arrive at 
a lead rrySlecule that shows an even hiqher activity, 

Minipepscan libraries, composed of only \ few 
thou^knd peptides, have been used to identify leadk molecules 
:h were quickly optimized to molecules having am activity 
.lar to that of molecules arrived at from startiiSs? from 
raries composed of millions of compounds. This has-been 
disclosed in Slootstra et al . , Molecular Diversity, 1 \l995b) 
87-96, and in Slootstra et al . , J. Mol . Recogn. , 10, 217^224 
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(1997) . In these articles, it has been shown that small 
libraries in combination with pepscan-based optimization 
methods are a valuable tool in identifying and optimizing 
lead molecules. 

5 The present invention aims to provide an improved 

method for identifying and optimizing a lead molecule. 

It has been found that the objective improvement is 
achieved by starting from a library of known test molecules, 
evaluating the activity of said library, and by optimizing 
10 the structure of a few of test molecules showing the highest 
activity, by methodically replacing each building block of 
the structure of the test molecules with all other possible 
building blocks. 

Thus, the invention relates to a method for 
15 determining a mimotope sequence for a receptor comprising the 
steps of : 

a) providing a library of test sequences; 

b) determining the activity of each test sequence of the 
library towards the receptor; 

20 c) identifying a test sequence comprising at a certain 

position a building block which, according to the results 
of step b) , is favored at said position; 

d) providing a next library of test sequences, based on said 
test sequence identified in step c) , by replacing a 

25 building block at selected positions of the identified 

test sequence with selected building blocks; 

e) determining the activity of each test sequence of the 
library provided in step d) towards the receptor; 

f ) identifying a test sequence comprising at a certain 

30 position a building block which, according to the results 

of step e) , is favored at said position; 

g ) repeating steps d) - f ) for the library of test sequences 
provided in step d) , for a number of cycles sufficient for 
finding in step f) a mimotope sequence that gives 

35 sufficient activity towards the receptor. 
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It has been found that the present method leads, in a 
convenient manner, to a mimotope sequence having a very high 
binding strength to the desired receptor- In some cases, 
where the exact epitope (i.e. the partial sequence of an 
antigen, that provides binding to an antibody) is known, the 
present method has been found to lead to the exact structure 
of the epitope in only a few steps. 

According to the invention, a mimotope sequence for a 
receptor is determined. In this context, a mimotope sequence 
is defined as a molecule that shows a certain, minimal, 
desired activity in the presence of a given receptor. An 
example hereof is the determination of the epitope sequence 
of an antigen for a certain antibody. However, in some cases 
it may be sufficient if the exact epitope is mimicked and 
that a slightly less binding strength of the found molecule 
to the receptor suffices. 

The mimotope sequence will be a molecule that is 
composed of a number of building blocks, wherein the number 
and order of building blocks controls the properties of the 
molecule. Examples of types of mimotope sequences that may be 
determined according to the invention include peptides, 
possibly having steroid or saccharide -like structures 
connected thereto, saccharides, DNA (oligonucleotides), and 
PNA (peptide-like nucleic acids). Thus, the building blocks 
will be chosen from the groups of amino acids (both natural 
and non-natural amino acids) ,. monosaccharides, and 

nucleotides . 

The receptor for which a mimotope sequences is 
determined according to the invention may be any compound, 
composition, microorganism, or tissue sample towards which 
one of the types of mimotope sequences may show some activity 
that can be measured, such as a binding activity. Suitable 
examples of receptors include antibodies (both monoclonal and 
polyclonal), proteins, such as enzymes, cells, hormone 
receptors, and micro-organisms. 
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The first step in a method according to the invention 
is the provision of a library of test sequences. Of course, 
these test sequences are composed of the same type or class 
of building blocks as the objective mimotope sequence. 
Preferably, the test sequences are known in that, although 
they may be randomly chosen, their composition and structure 
is known. This may be accomplished by generating, for 
instance by hand or by computer, a number of sequences and 
synthesizing the sequences thus obtained. It is also possible 
to derive the library of test sequences from a compound which 
is known to have a favorable activity towards the receptor. 
In principle, any known manner of selecting test sequences to 
make up the library is suitable. 

The length of the test sequences is dependent on the 
nature of the building blocks constituting the sequences, and 
on the nature of the receptor and the desired activity 
towards said receptor.. For most purposes, a length of between 
3 and 20 building blocks will be suitable. 

The number of test sequences constituting the library 
will be large enough to provide sufficient data to come to a 
good mimotope sequence in an acceptable number of 
steps/cycles. On the other hand, said number will be small 
enough to assure that the data obtained can be handled quite 
conveniently. Usually, the number of test sequences in the 
library will lie between 500 and 100,000, preferably between 
1,000 and 10,000. 

After the desired number of test sequences is 
generated, said test sequences may be synthesized. This may 
be done in any known manner, for instance as has been 
described by Slootstra et al . in Molecular Diversity, 1 
(1995b), 87-96. Of course, it is also possible to make use of 
a library which is already available, for instance because it 
has been used in a previous run of the present method. In 
that case, the test sequences will not have to be 
synthesized. 
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For reasons of convenience, the test sequences are 
preferably synthesized using a minicard or a flat support 
medium as has been described in Dutch patent application 
10.019703. 

5 The next step of a method according to the invention 

is the determination of the activity of each test sequence of 
the library towards the receptor. The manner wherein this 
determination is carried out will depend on the specific 
interaction between mimotope sequence and receptor that is 
10 aimed at, and on the nature of the receptor and the building 
;p " blocks of the mimotope sequence. For instance, when the 

s £ desired activity is a binding of the mimotope sequence to the 

;7y receptor, and the mimotope sequence is a peptide and the 

IN" receptor is a monoclonal antibody, the determination may 

15 suitable be performed in an ELISA test, either in solution or 
Sf on a solid support. Other suitable methods of determining the 

L activity include BIACORE and AFM (Atomic Force Microscope) . 

5 The skilled person will be able to choose a suitable manner 

^ of determination of the activity, given a certain receptor 

p 2 0 and nature of the mimotope sequence. 

From the results of the determination of the 
activities of the test sequences of the library, at least one 
test sequence will be chosen to form a basis for the 
remaining steps of the present method. Said at least one test 

2 5 sequence is, in accordance with the invention, chosen by 
identifying at a certain position a building block which, as 
appears from the results of the determination of the 
activity, is favored at said position. In this regard, by the 
phrase "at a certain position a building block which is 

3 0 favored at said position", is meant that test sequences having 
at said position said building block show a high activity 
towards the receptor, relative to other activities found. 

It is possible that the at least one test sequence 
which is identified was tested for its activity towards the 
35 receptor and showed itself a high activity, relative to other 
activities found. 
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It is also possible that the at least one test 
sequence which is identified, was in itself not tested- This 
can be explained as follows. For instance, it may have been 
found that a test sequence having at position 2 building 
block A shows a very high activity towards the receptor. It 
may further have been found that a test sequence having at 
position 4 building block B also shows a very high activity 
towards the receptor. The at least one test sequence to be 
identified may then be a sequence comprising building block A 
at position 2 and building block B at position 4. However, it 
may -very well be that this sequence itself had not yet been 
evaluated for its activity towards the receptor. 

Generally, the number of test sequences identified 
for basing the remaining steps of a method according to the 
invention on will be between 1 and 150. A higher number of 
test sequences chosen will lead to better results, but will 
be relatively more cumbersome to handle. Preferably, said 
number will be between 1 and .25. The skilled person will be 
able to decide on the number of test sequences chosen, 
dependent on the desired quality of the result of the method 
and the facilities available for carrying out the method. 

On the basis of the identified test sequences, a next 
library of test sequences is provided in a so-called 
replacement analysis. This is done by varying the building 
blocks at positions of the identified test sequences, i.e. by 
replacing a building block at selected positions of the 
identified test sequence with selected building blocks. 

For instance, in its simplest form the replacement 
analysis is carried out as follows. In case the test 
sequences are dodecapept ides of natural amino acids, first, 
the building block at position one of one of the chosen 
dodecapept ides may changed into each of the twenty naturally 
occurring amino acids, leading to 2 0 new test sequences. This 
is repeated for each position of the dodecapept ide , leading 
to 20 * 12 = 240 test sequences. Among these 240 sequences, 
the original dodecapeptide will be present 12 times. This 
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procedure is repeated for each chosen dodecapeptide . Thus, a 
next library is obtained which comprises from 240 to 3600 new 
test sequences . 

More complex forms of this replacement analysis may 
5 be performed, for instance by allowing certain building 

blocks to be replaced by groups of building blocks (e.g. the 
replacement of one amino acid by two or more amino acids) . 
Usually, such a replacement by multiple building blocks will 
be used sparingly. Preferably, in at least one replacement 
_ 10 analysis of the different cycles of the present method, at- 

□ r least one building block is replaced by a group of building 

W blocks. Care should be taken that in each replacement 

^ " analysis not too many multiple building blocks are 

5 introduced, for then problems concerning the length of the 

!H 15 sequence may arise. For the same reason, the multiple group 

of building blocks should preferably not be too long. The 
D skilled person will, based on his experience in the field, be 

jf{ able to judge which size and number of multiple building 

H> blocks may be introduced. 

It is also possible to replace a building block by a 
void in the replacement analysis. That way, the sequence will 
become one building block shorter. It will be clear that not 
too many building blocks should be replaced by a void at the 
same time, as this could lead to an undesirably short (or no) 
25 test sequence. The skilled person will be able to judge when 
and at which position it is helpful to introduce a void into 
a test sequence . 

In addition, it is possible to allow the introduction 
of selected building blocks in the replacement analysis only, 
3 0 or to only replace building blocks at selected positions in 
the test sequences. Sometimes it is preferred to maintain 
certain building blocks at certain positions, e.g. in order 
to maintain a desired three dimensional structure of the test 
sequence. An advantage of this manner of performing the 
35 replacement analysis is that the total number of test 

sequences to be provided is limited. Generally, however, if 
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the building blocks at all positions of the test sequence are 
replaced by all possible building blocks, the test sequences 
having the desired three dimensional structure will also be 
provided. Again, it is within the skill of the artisan to 
5 judge in which cases it may be helpful to carry out a 

replacement at selected positions only. He will also be able 
to judge to which positions this may apply, and which 
building blocks may be used for the replacement and which 
building blocks may best not be used in the replacement (at 
10 certain positions) . 

The test sequences of the next library, which 
provided on the basis of the replacement analysis of the at 
least one identified test sequence, are tested for their 
S 1 activity towards the receptor. In case a test sequence is 

tjj 15 present among the test sequences of this library that shows 

2 sufficient activity towards the receptor, the method is 

3 completed and the desired mimotope sequence is obtained. It 
m will be clear that it will depend on the type of activity, 
[Z the receptor, the mimotope sequence, and the objective 

M= 2 0 application of the mimotope sequence, whether a certain 

activity is regarded sufficient. Given the circumstances, the 
skilled person will be able to choose a threshold for a 
desired activity. 

In case none of the test sequences shows sufficient 

25 activity towards the receptor, another cycle is carried out . 
In accordance with the invention, preferably at least two 
cycles, and thus two replacement analyses, are carried out. 
Thus, a next library of test sequences is provided, based on 
the 10 to 15 test sequences of the previous library that 

30 showed the highest activity, as set forth above. The test 
sequences in this next library will be evaluated for their 
activity, and so on. 

The number of cycles that has to be carried out will 
depend on the desired degree of activity of the mimotope 

35 sequence to be found for the receptor. Starting from about 
4 50 0 randomly chosen dodecapeptides , it has been found 
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possible to arrive at the exact epitope sequence for a 
monoclonal antibody in less than three cycles. 

in a preferred embodiment of the invention, the 
amount of receptor used for determining the activity of the 
test sequences towards said receptor, is lowered in each 
cycle. It has been found that building blocks that seem 
essential in the first cycle may become non-essential in 
further cycles. In other words, in said first cycle local 
optima may be found, which may be overcome in a subsequent 
cycle or in subsequent cycles. The lowering of the amount of 
the receptor in the determining of the activity of the test 
sequences has been found to assist in overcoming such local 
optima. Preferably, the amount of receptor used in the 
determination of the activities of test sequences in a 
library is reduced by a factor of from 50 to 1000, more 
preferably from 10 to 100. 

The invention will now be further elucidated by the 
following, non-restrictive examples. 

EXAMPLES 

The described examples illustrate six variations of 
the present optimization method that has been successfully 
applied to lead peptides, identified with antibodies 
(monoclonal and polyclonal) raised against peptides, 
proteins, viruses, bacteria, sugars and steroids, from 
various types of peptide libraries.. It can be envisaged that 
additional variations are possible. The concept that rules 
these variations is the repetition of replacement analyses on 
1 or more lead peptides. Local minima are overcome and 
epitopes and mimotopes are identified. 
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Materials and Methods 

Synthesis and screening minicards libraries (used in examples 
1, 2, 5, 6) 

Using the 20 natural L-amino acids 4550 random 
dodecapeptides (12-mers) were generated with a random 
generator programmed in Quick basic which runs on a 486 DX2 
(66 MHz) computer system. In this library the frequency of 
each residue is approximately 5%. This set of sequences was 
used to design additional libraries. One in which all 
sequences are in the D-amino acid form, one in which the 2nd 
and 11th position are held by a cysteine and one in which the 
3rd and 10th position are held by a cysteine. The aim of the 
latter two is to introduce a disulphide bridge into the 
peptide which should present the peptides as loops. Very 
important here is that the sequence of the amino acids within 
and outside the cysteines is identical to that of the first 
set of 4550 dodecapeptides. In this way the activity of these 
sets of dodecapeptides can be compared. Example- 1 describes 
results obtained with the synthetic peptide library one in 
which the 3rd and 10th position are held by a cysteine. 
Example-2 describes results obtained with the synthetic 
peptide library composed of 4550 random dodecapeptides (12- 
mers) , i.e. no cysteine or any other motif was used. 

The libraries were synthesized and screened using 
credit-card format mini-PEPSCAN cards (455 peptides/card) as 
described previously (Slootstra et al . , 1995b). In example-1 
the binding of monoclonal antibody 2 6/9 to each peptide was 
tested in a PEPSCAN-based enzyme -linked immuno assay (ELISA) . 
Monoclonal antibody 26/9 has been described previously 
(Wilson et al. , 1984; Rini et al . , 1992; Churchill et al., 
1994) . Monoclonal antibody 26/9 was raised against the 
peptide HA175-110 of the hemagglutinin protein of influenza 
virus (X47:HA1) (Wilson et al . , 1984). 

In example-1 the 455 -well creditcard- format 
polyethylene cards, containing the covalently linked 
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peptides, were incubated with antibody 26/9 (100 ^g/ml) . 
After washing the peptides were incubated with rabbit-anti- 
mouse peroxidase (rampo, dilution 1/1000) (Dakopatts) (1 hr. 
2 5oC) , and subsequently, after washing the peroxidase 
substrate 2 , 2 ' -azino-di-3-ethylbenzthiazoline sulfonate 
(ABTS) and 2 M l/ml 3% H202 were added. After 1 hr the color 
development was measured. The color development of the ELISA 
was quantified with a CCD-camera and an image processing 
system. The setup consists of a CCD-camera and a 55 mm lens 
(Sony CCD Video Camara XC-77RR, Nikon micro-nikkor 55 mm 
f/2 8 lens), a camara adaptor (Sony Camara adaptor DC-77RR) 
and the Image Processing Software package TIM, version 3.36 
(Difa Measuring Systems, The Netherlands) . TIM runs on a 486 
DX2 (5 0 MHz) computer system. 

The dodecapeptide library, composed of 4550 random 
dodecapeptides, is screened with relatively high 
concentration of antibody (100 M g/ml, more usually 10 /ig/ml. 
see examples 2-6) . This concentration is approximately 2 
orders of magnitude above the concentration required to 
obtain maximal binding activity in ELISA with the native 
epitope peptide (not shown) . The concentration of 100 M g/inl 
was used to obtain many binding peptides from the small set 
of 4550 random dodecapeptides. In previous studies it was 
shown that such a strategy, without obtaining an unfavorable 
signal to noise ratio, can result in hundreds of binding 
peptides that resemble small linear or non-linear parts of 
the native epitope (Slootstra et al . , 1995b; Slootstra et 
al . , 1997) . 

Synthesis and screening of OTHER pepscan libraries (used in 
example 4) 

in addition to random minipepscan libraries mimotopes 
can also be identified from standard pepscan libraries. These 
libraries contain all overlapping 12-mers (or shorter/longer) 
covering the linear sequence of a known protein (Slootstra et 
al . , 1995a) . 
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Synthesis and screening of non-pepscan libraries (used in 
examples 3, 6) 

Lead peptides can also be derived from other type of 
libraries such as for example phage-display libraries . 

Sequence analysis of lead molecules (used in example-5) 

All 4550 dodecapeptides were ranked according to 
their binding activity. According to their ranking consensus 
sequences and motifs are identified. Initial methods of lead 
optimization that only use the sequence of the top 50 
molecules have been described in Slootstra et al . (1995b; 
1997) . The present method uses all 4550 sequences. Firstly, 
the frequency and/or distribution of single amino acids and 
the dipeptide motifs OO, dipeptide motifs OXO and dipeptide 
motifs OXXO is determined using Microsoft Excel 4.0 (O, one 
of the 20 natural L-amino acids; X, any residue) . Secondly, 
properties of the amino acids ( hydrophobic ity, charge etc.) 
are included in the analysis. Thirdly, all this data is used 
to optimize the activity of the top 10-50 lead molecules. 
This is done by substitution of building blocks that inhibit 
activity and include building blocks that improve binding 
activity . 

The present sequence analysis method improves the 
activity of lead molecules through motif analysis of all 
sequences part of the library. 

Replacement analyses (used in examples 1, 2, 3, 4, 5, 6) 

In a replacement analysis the binding activity of 
complete series of substitutions analogs of lead peptides, in 
which each position is replaced by each of the other 19 
natural L-amino acids, is investigated in detail. In 
addition, it is possible to use the 20 D-amino acids as well 
as other available non-natural amino acids. Substitutions 
that result in improved binding are combined into new 
sequences of building blocks. These new sequences are again 
tested in a 2nd replacement analysis. If necessary more 
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rounds of replacement analyses are performed. After each 
round of a replacement analyses the antibody (or other 
soluble receptor) concentration can be lowered to obtain 
maximal binding activity (e.g. from 100 M g/ml to 0.01 jig/ml 
after 2 rounds of replacement - analyses , as shown in example- 

1) . 
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Notes for the examples: 

- the 10 /xg/ml, 1 /xg/ml etc. denote the antibody 
concentration. 

- The line 11 improved posit ion- 01 " etc. means that 
this change in amino acid (is underlined) improves binding 
activity in pepscan elisa at given antibody concentration. 
Positions that could not be improved are not mentioned (e.g. 
position- 08 in first replacement -net of example- 1, i.e. the 
original I at position 8 gives the highest activity) . 

Example- 1 : 

title: Identification of an epitope through a lead peptide 
selected from 4550 random dodecapeptides (minipepscan library 
with motif XXCXXXXXXCXX , X , randomly selected amino acid, 
cf . Materials and Method) ) 

tools: lead peptide GCGAAMNIRCYA 

methodology: two rounds of replacement analyses 
Results: 

In example- 1, the lead peptide CGCAAMNIRCYA was 
derived (with antibody 26/9) from the random library in which 
the 3rd and 10th position are held by a cysteine (see above 
in Materials and methods) . All possible single substitution 
analogs of the random peptide CGCAAMNIRCYA tested in pepscan 
for binding antibody 26/9 at 100 fig/ml . Below these results 
are described in detail . 

The replacement analysis of the lead peptide 
CGCAAMNIRCYA resulted in the identification of building 
blocks that cannot be replaced by any other building blocks 
(e.g. Y and A) , and to the identification of residues that 
can be replaced by one or two other building blocks (e.g. N) , 
and to residues that can be replaced by many other building 
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blocks (e.g. M) . Some replacements improve binding activity 
considerably (e.g. N into D) . 

All these replacements were used to design the 
improved peptide EMDEEEDIMNYA . Note that this peptide binds 
5 antibody 26/9 at much lower concentration, i.e. has improved 
binding activity (100 /zg/ml for CGCAAMNIRCYA and 1.0 fig/ml 

for EMDEEEDIMNYA) . 

The peptide EMDEEEDIMNYA was run through a second 
replacement analysis. Again some replacements improve binding 

10 activity considerably. These replacements were used to design 
the -improved peptide EMDEEEDVPDYA . Essential is that the 
first part of EMDEEEDIMNYA, EMDEEE, does not contain critical 
residues whereas the latter part, DIMNYA, does. Combination 
of the improved residues in this latter part results in the 

15 sequence DVPDYA . The sequence DVPDYA is identical to the 
linear epitope of antibody 26/9. Thus, the lead peptide 
CGCAAMNIRCYA derived from a few thousand random 
dodecapeptides was turned into native epitope sequence 
through two replacement analyses. 

20 

REPLACEMENT ANALYSIS- 1 

original lead: CGCAAMNIRCYA 

activity at > 100.0 M9/ ml 
improved position-01: EGCAAMNIRCYA 
25 activity at > 10.0 /xg/ml 

improved position-02: CMCAAMN I RC YA 

activity at > 10.0 |ig/ml 
improved position-03: CGDAAMN I RC Y A 

activity at > 10.0 tig/ml 
30 improved position-04: CGCEAMNI RCYA 

activity at > 10.0 fig/ml 
improved position-05: CGCAEMNI RCYA 

activity at > 10.0 /xg/ ml 
improved position-06: CGCAAENIRCYA 
35 activity at > 10.0 /ig/ml 

improved position-07: CGCAAMDIRCYA 
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activity at > 10.0 jzg/ml 
improved position-09 : CGCAAMNIMCYA 

activity at* > 10.0 /xg/ml 
improved position-10: CGCAAMNIRNYA 

5 activity at > 10.0 /zg/ml 
combination improvements: EMDEEED I MNYA 

activity at > 1 . 0 pg /ml 

REPLACEMENT ANALYSIS-II 
10 combinations rep-an. I: EMDEEEDIMNYA 

activity at > 1.0 fig/ml 
improved position-08: EMDEEEDVMMYA 

activity at > 0 . 1 [ig/ml 
improved position-09: EMDEEEDIPNYA 
15 activity at > 0 . 1 ixg/ml 

improved position-10: EMDEEED I MDYA 

activity at -> 0.1 /ig/ml 

6 combination improvements: EMDEEEDVPDYA 

activity at >. 0.01 //g/ml 



20 



25 



Notes : 

The sequence DVPDYA is the original epitope. The 
peptide EMDEEEDVPDYA has a 10 -fold improved binding affinity 
(in solution) over the native epitope peptide YPYDVPDYASLRS . 



Example -2 : 

title: Identification of a mimotope through a lead peptide 
30 selected from 4550 random dodecapept ides (random minipepscan 
library) 

tools: lead peptide ANWPS AI GAFGL 



3 5 methodology: three rounds of replacement analyses 
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Results : 

In example-2, lead peptides were identified through a 
random minipepscan library. The difference of example-2 with 
example -1 is that the antibody used in example- 1 binds a 
linear epitope whereas the antibody used in example-2 binds a 
non- linear epitope. The following multiple replacement 
analyses were done as discussed in example-1. The identified 
mimotope does not resemble any region of the linear sequence 
of the native protein. 



REPLACEMENT ANALYSIS -I 
original lead: 



improved pos i t i on -01 
improved position- 02 



25 



ANWPSAIGAFGL 
activity at > 10.0 /xg/ml 
HNWP S A I GAFGL 
activity at > 5 . 0 /xg/ml 
AWWPSAIGAFGL 
activity at > 5 . 0 iig/ral 
improved ppsition-03: ANAP S A I GAFGL 

activity at > 5 . 0 /xg/ml 
ANWSSAIGAFGL 
activity at > 5 . 0 M9/ ml 
ANWS SAIGAFKL 
activity at > 5.0 fig/ml 
combination improvements: HWAS SAIGAFKL 

activity at > 1 . 0 fig/ml 



improved position- 04 
improved position - 11 



30 



35 



REPLACEMENT ANALYSIS- 1 1 

combinations rep -an. I: HWAS SAIGAFKL 



improved position- 01 : 
improved position- 02 : 
improved position-03: 
improved position-07: 



activity at > 1 . 0 /zg/ml 

KWAS SAIGAFKL 

activity at > 0 . 5 fig/ml 

HYAS SAIGAFKL 

activity at > 0 . 5 txg/ml 

HWGS SAIGAFKL 

activity at > 0 . 5 /xg/ml 

HWAS S AMGAFKL 
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activity at > 0 . 5 fxg/ml 
combination improvements: KYG SSAMGAFKL 

activity at > 0 . 1 //g/ml 



REPLACEMENT ANALYSIS- III 
combinations rep-an. II: KYGSSAMGAFKL 

activity at > 0 . 1 fig/ml 
improved position-03: KY FS S AMGAFKL 

activity at > 0.05 fxg/ml 
improved position-06: KYGSSGMGAFKL 

activity at > 0.05 /ig/ml 
combination improvements: KYFSSGMGAFKL 

activity at > 0.01 /xg/ml 



Example- 3: 

title: Identification of a mimotope through a lead derived 
20 from phage-display library composed of >1000.000 random 
hexapeptides . 

tools: lead peptide SDTRKG* 

25 methodology: lengthening to the left and to the right of 

SDTRKG with cysteines/glycines, followed by three rounds of 
replacement -analyses 

Results : 

30 In example-3, lead peptides were identified through 

phage-display. Adjacent cysteines/glycines were added to 
improve binding activity. After obtained activity multiple 
replacement analyses were done as discussed in example-1. 
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LENGTHENING BY CYSTEINES/ GLYCINES : 
phage -display lead: SDTRKG 

NO activity at 10.0 /xg/ml 

lengthened lead: CSDTRKGC 

activity at > 10.0 fig/ml 

lengthened lead: CSDTRKGCG 

activity at > 10.0 /xg/ml 



REPLACEMENT ANALYSIS- 1 
lengthened lead: 

improved position-02: 

improved position-03: 

improved posit ion- 05: 

improved position-06: 

combination improv. : 

REPLACEMENT ANALYSIS-II 
comb, rep-an. I: 

improved position-02: 

improved position-05: 

improved position-06: 

combination improv. : 



CSDTRKGCG 

activity at* > 10.0 fig/ml 
CTDTRKGCG 

activity at > 5.0 /xg/ml 
CSETRKGCG 

activity at > 5 . 0 fig/ml 
CSDTHKGCG 

activity at >, 5 . 0 /xg/ml 
CSDTRYGCG 

activity at > 5 . 0 /xg/ml 
CTETHYGCG 

activity at > 1 - 0 /xg/ ml 
CTETHYGCG 

activity at > 1 - 0 /xg/ml 
CYETHYGCG 

activity at > 0 . 5 {xg/ml 
CTETKYGCG 

activity at > 0 . 5 ^g/ml 
CTETHFGCG 

activity at > 0 . 5 /xg/ml 
CYETKFGCG 

activity at > 0 . 1 /xg/ml 
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REPLACEMENT ANALYSIS- III 

comb, rep -an. II: CYETKFGCG 

activity at > 0 . 1 fig/ml 
improved position-01: DYETKFGCG 

activity at > 0.05 fig/ml 
improved position-08: CYETKFGNG 

activity at > 0.05 /zg/ml 
combination improv. : DYETKFGNG 

activity at >, 0.01 /zg/ml 

*, this lead peptide is, as a synthetic peptide, not active 
{not in elisa (pepscan or standard nor in solution) . This is 
not unique. Often phage-peptides are only active as part of 
the phage-coat protein. In other formats they lose their 
activity. 



Example -4 : 

title: Identification of a mimotope through a lead derived 
from a standard pepscan analysis. 

tools: lead peptide RVMIKLILVNFR* and complete sequence of 
native protein (part which was used is KIYRVMIKLILVNFRMQP) . 

methodology: Two rounds of replacement -analyses, followed by 
lengthening to the left and right, again followed by two more 
rounds of replacement -analyses , finally followed by 
lenthening to an 18-mer mimotope. 

Results : 

In example-4, a lead peptide was identified through 
standard pepscan analysis, i.e. the antibody was tested on 
all overlapping 12-mers covering the linear sequence of the 
protein. The following two rounds of replacement analyses 
were done as discussed in example-1. In addition, the 
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mimotope was lengthened to the left and right (3 amino acids, 
this can be further lengthened) through additional 
replacement analyses. 



REPLACEMENT ANALYSIS -I 
original lead: 

improved position- 01: 

improved position-02: 

improved position-03: 

improved position-08: 

improved position-11: 

combination improv. : 

RE PLACEMENT ANALYSIS- II 
comb, rep -an . I: 

improved position- 01 : 

improved position-02 : 

combination improv. : 



RVMIKLILVNFR 
activity at >, 
AVMIKLILVNFR 
activity at >. 
AIMIKLILVNFR 
activity at >, 
AVPIKLILVNFR 
activity at > 
AVMI KLIRVNFR 
activity at >. 
AVMI KL I LVNYR 
activity at > 
AIPIKLIRVNYR 
activity at j> 



AIPIKLIRVNYR 
activity at >. 
YIPIKLIRVNYR 
activity at _> 
APPIKLIRVNYR 
activity at >, 
YPPIKLIRVNYR 
activity at > 



10.0 fJLg/ml 

5.0 /zg/ml 

5.0 /zg/ml 

5.0 /zg/ml 

5 . 0 M9/ ml 

5 . 0 /ig/ml 

1.0 fxg/ral 



1.0 /xg/ml 
0 . 5 /ig/ml 
0.5 /zg/ml 
0 . 1 /ig/ml 



35 



LENTHENING WITH NATIVE SEQUENCE: 
comb, rep-an. II: YPPIKLIRVNYR 

activity at >. 
lengthened left: KIYYPPIKLIRV 

activity at > 
lengthened right: I KL I RVNYRMQP 



0 . l /ig/ml 
1.0 /zg/ml 
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activity at > 1.0 /xg/ ml 



REPLACEMENT ANALYS I S - 1 1 1 of left lengthened peptide: 



lengthened left 



improved posit-01 



improved posit -02 



improved posit-03 



improved posit-08: 



combination impr, 



KIYYPPIKLIRV 
activity at >. 
RIYYPPIKLIRV 
activity at > 
KPYYPPIKLIRV 
activity at >. 
KIWYPPIKLIRV 
activity at >_ 
KIYYPPISLIRV 
activity at >, 
RPW YPPISLIRV 
activity at >. 



1 . 0 fig/ml 

0 . 5 (ig/val 

0 . 5 /xg/ml 

0 . 5 /ig/ml 

0 . 5 /ig/ml 

0 . 1 M9/ ml 



REPLACEMENT ANALYSIS- IV of right lengthened peptide: 



lengthened right : 
improved posit -10: 
improved posit -11: 
improved posit-12: 
combination impr. : 

combination total : 



IKLIRVNYRMQP 

activity at > 1 . 0 ixg/ml 

IKLIRVNYRCQP 

activity at > 0 . 5 /zg/ml 

I KL I RVNYRME P 

activity at > 0 . 5 /ig/ml 

IKLIRVNYRMQN 

activity at > 0 . 5 fig/ml 

I KL I R VNYRCEN 

activity at > 0 . 1 fig/vc\l 

RPWYPPISLIRVNYRCEN 
activity at > 0.01 fig/ml 



*, lead peptide was identified from a library composed of all 
overlapping 12-mers covering the linear sequence of a 
protein. This makes it possible to lengthen the peptide to 
the left and right with adjacent amino acids, in this case 
KIY on the left and MQP on the right. Additional replacement- 
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analyses of 12-mers, shifted three to the left or three to 
the right of the sequence finally results in an 18-mer 
mi mo tope . 



Example -5 : 

title: Identification of a mimotope through a set of similar 
leads selected from 4550 random dodecapeptides (random 
minipepscan library) 

tools: set of 6 lead peptides QNNMKLFRGCVP , RG I KWNEMTDQ W , 
KLQQNPTFYPPV, TNNCKEFAGIVP , RGILTNIMKDQW, IVQNNPKFFRGA 
(potentially up to all 4550 peptides, see material and 
method, in this case also 'negative* amino acids are removed 
from the leads) 

methodology: : determination of consensus sequence, followed 
by two rounds of replacement analyses of the consensus 
sequence 

Results : 

In example-5, a set of similar lead peptides were 
used to identify a consensus sequence. The consensus 
sequences were identified as discussed in materials and 
method. This sequence was used in the replacement analyses. 

ALIGNMENT OF LEAD PEPTIDES: 



QNNM KL FRG CVP 
RGIKWNEMTDQW 
KLQQNPTFYPPV 
TNNCKEFAGIVP 
RGILTNIMKDQW 
IVQNNPKF FRGA 
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consensus : ILONNMKDFRG 



REPLACEMENT ANALYSIS -I 
consensus lead: 

improved position-03: 

improved position-07: 

improved posit ion -10: 

combination improv. : 

REPLACEMENT ANALYSIS- II 
comb, rep-an. I: 

improved position-07: 

improved position-11: 

combination improv. : 



ILQNNMKDFRG 

activity at > 1.0 M9/ ml 
I LTNNMKDFRG 

activity at > 0 . 5 ttg/ral 
ILQNNMPDFRG 

activity at > 0 . 5 /ig/ml 
ILQNNMKDWRG 

activity at > 0 . 5 fig/ml 
ILTNNMPDWRG 

activity at > 0 . 1 fig/ml 



ILTNNMPDWRG 

activity at - > 0.1 Mg/ ml 
ILTNNMGDWRG 

activity at > 0.05 ixg/ml 
I LTNNMPDWYG 

activity at > 0.05 M9/ ml 
ILTNNMGDWYG 

activity at > 0.01 txg/ml 



Example- 6 : 

title: Identification of a mimotope through a set of 
different leads selected from phage-display library composed 
of >1, 000,000 random hexapeptides . 

tools: set of 3 lead peptides (ANWPSA, KLITRW, NVCSWS) 

methodology: Two rounds of replacement analyses (for each 
lead peptide), followed by determination of overall consensus 
sequence . 
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Results : 

In example-6, different lead peptides were used to 
identify a consensus sequence. Each lead peptide was used ii 
multiple replacement analyses (two rounds) . The resulting 
three mimotopes were aligned which resulted in a lengthened 
mimotope with improved activity. 



REPLACEMENT ANALYSIS - I A 
original lead : ANWPSA 

improved position-01: HNWPSA 
improved position- 02: AWWPSA 
improved position-03: ANAPSA 
improved position-04: ANWSSA 
combination improv. : H WAS SA 

REPLACEMENT ANALYSIS- 1 1 A 
comb. rep -an. I A : HWASSA 
improved position- 05 : HWASPA 
combination improv. : HWASPA 

REPLACEMENT ANALYSIS- IB 
original lead : KLITRW 

improved position-01: SLITRW 
improved position-02: KSITRW 
improved position-03: KLATRW 
improved position-06: KLITRY 
combination improv. : SSATRY 

REPLACEMENT ANALYSIS- I IB 
comb. rep -an. IB : SSATRY 
improved position-02: SPATRY 
combination improv. : SPATRY 

REPLACEMENT ANALYSIS -IC 
original lead : NVCSWS 

improved position-02: NXCSWS 



activity 


at 


>10 . 


0 




/xg/ml 


activity 


at 


>_ 


5. 


0 




/xg/ml 


activity 


at 


_> 


5. 


0 




/xg/ml 


activity 


at 


_> 


5 . 


0 




jxg/ml 


activity 


at 


_> 


5. 


0 




jxg/ml 


activity 


at 


_> 


1 . 


0 




/ig/ml 


activity 


at 


>_ 


1 . 


0 




pig/ml 


activity 


at 


> 




0 . 


5 


[ig/ml 


activity 


at 


> 




0 . 


5 


fig/ml 


activity 


at 


>10. 


0 




/xg/ml 


activity 


at 


>_ 


5. 


0 




/ig/ml 


activity 


at 




5 . 


0 




fig /ml 


activity 


at 


> 


5. 


0 




/ig/ml 


activity 


at 


> 


5. 


0 




/xg/ml 


activity 


at 


>_ 


1 . 


0 




/xg/ml 


activity 


at 


>_ 


1 


0 




/xg/ml 


activity 


at 






0 . 


5 


/xg/ml 


activity 


at 


> 




0 . 


5 


/xg/ml 


activity 


at 


>10 


.0 




/zg/ml 


activity 


at 




5 


.0 




/xg/ml 
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combination improv. : NICHWA 
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activity at > 5 . 0 /xg/ml 
activity at > 5.0 fig/ral 
activity at > 1.0 /xg/ ml 



RE PLACEMENT ANALYSIS- 1 IC 



comb. rep-an. IC 
improved posit ion- 01 
improved position- 02 



combination improv. : YVCHWA 



NICHWA 
YICHWA 
NVCHWA 



activity at > 1 . 0 fig/ml 

activity at >. 0.5 /zg/ml 

activity at > 0.5 /xg/ml 

activity at > 0.1 /xg/ml 



ALIGNMENT combination improv. IIA, I IB and IIC: 



combination improvements 1 

combination improvements 2 

combination improvements 3 
consensus : 



HWASPA 

SPATRY 

YVCHWA 

YVCHWAS S ATR Y 
activity at 0.01 fig/ml 



